Improved HMM training and scoring strategies with application to accent classification

نویسندگان

  • Levent M. Arslan
  • John H. L. Hansen
چکیده

In this study we propose two methods to improve HMM speech recognition performance. The rst method employs an adjustment in the training stage, whereas the second method employs it in the scoring stage. It is well known that speech recognition system performance increases when the amount of labeled training data is large. However, due to factors such as inaccurate phonetic labeling, end-point detection, and voiced-unvoiced decisions, the labeling procedure can be prone to errors. In this study, we propose a selective hidden Markov Model (HMM) training procedure in order to reduce the adverse innuence of atypical training data on the generated models. To demonstrate its usefulness, selective training is applied to the problem of accent classiication, resulting in a 9.4% improvement in classiication error rate. The second goal is to improve HMM scoring performance. The objective of HMM training algorithms is to maximize the probability over the training tokens for each model. However, this does not guarantee a minimized error rate across the entire model set. Typically, biases in the confusion matrices can be observed. We propose a method for estimating the bias from input training data, and incorporating it into the general scoring algorithm. Using this technique, a 9.8% improvement is achieved in accent classiication error rate.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Automatic Clustering Using a Multi-Objective Evolutionary Algorithm With New Validity measure and application to Credit Scoring

In data mining, clustering is one of the important issues for separation and classification with groups like unsupervised data. In this paper, an attempt has been made to improve and optimize the application of clustering heuristic methods such as Genetic, PSO algorithm, Artificial bee colony algorithm, Harmony Search algorithm and Differential Evolution on the unlabeled data of an Iranian bank...

متن کامل

Effect of education on the knowledge and attitude of intensive care unit staff towards the use of predictive disease severity scoring systems

Background and Purpose: Severity of illness scoring systems is used for the classification of patients to receive medical services, predict the risk of mortality, determine hospital bed occupancy, and assess treatment progress. In Iran, these scoring systems are not frequently used due to the lack of knowledge of medical staff. This study aimed to evaluate the effect of education on the knowl...

متن کامل

Beating Henry Higgins at His Own Game: A Markovian Approach to Dialectology

1. Introduction The performance of speech recognition algorithms degrades considerably due to speaker variability. Aside from gender, the largest cause for speaker variability is accent. If the accent of a speaker can be determined automatically, then accent-specific speech recognition models can be used, thereby increasing speech recognition accuracy. In this study, the problem of accent class...

متن کامل

Acoustic Modeling of Accented English Speech for Large-vocabulary Speech Recognition

In this paper, we present a study on robust speech recognition with respect to accent variations. Differences that characterize accents in speech can be divided into two parts: phonetic and acoustic. We focus on the acoustic differences and the ways of acoustic model design and training that can be used to minimize the effect of accent variations on the speech recognition system’s performance. ...

متن کامل

Acoustic model selection for recognition of regional accented speech

Accent is cited as an issue for speech recognition systems [1]. Research has shown that accent mismatch between the training and the test data will result in significant accuracy reduction in Automatic Speech Recognition (ASR) systems. Using HMM based ASR trained on a standard English accent, our study shows that the error rates can be up to seven times higher for accented speech, than for stan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996